Multilabel Prototype Generation for data reduction in K-Nearest Neighbour classification
نویسندگان
چکیده
• Multilabel Prototype Generation for efficient k-Nearest Neighbour classification Four multiclass methods are adapted to the multilabel space Evaluation with twelve corpora, three noise scenarios, and different classifiers Proposals significantly improve performance reduction rates of reference strategies Novel adaptations proposed also show significant capabilities (PG) typically considered improving efficiency k -Nearest ( NN) classifier when tackling high-size corpora. Such approaches aim at generating a reduced version corpus without decreasing compared initial set. Despite their large application in very few works have addressed proposal PG space. In this regard, work presents novel adaptation four case. These proposals evaluated NN-based classifiers, 12 corpora comprising varied range domains sizes, scenarios artificially induced data. The results obtained that capable improving—both terms performance—the only literature as well case which no method is applied, presenting statistically superior robustness noisy scenarios. Moreover, these allow prioritising either or efficacy criteria through its configuration depending on target scenario, hence covering wide area solution not previously filled by other works.
منابع مشابه
Introduction to k Nearest Neighbour Classification and Condensed Nearest Neighbour Data Reduction
Suppose a bank has a database of people’s details and their credit rating. These details would probably be the person’s financial characteristics such as how much they earn, whether they own or rent a house, and so on, and would be used to calculate the person’s credit rating. However, the process for calculating the credit rating from the person’s details is quite expensive, so the bank would ...
متن کاملLove Thy Neighbour: Automatic Animal Behavioural Classification of Acceleration Data Using the K-Nearest Neighbour Algorithm
Researchers hoping to elucidate the behaviour of species that aren't readily observed are able to do so using biotelemetry methods. Accelerometers in particular are proving particularly effective and have been used on terrestrial, aquatic and volant species with success. In the past, behavioural modes were detected in accelerometer data through manual inspection, but with developments in techno...
متن کاملEfficient model selection for probabilistic K nearest neighbour classification
ProbabilisticK-nearest neighbour (PKNN) classification has been introduced to improve the performance of the original K-nearest neighbour (KNN) classification algorithm by explicitly modelling uncertainty in the classification of each feature vector. However, an issue common to both KNN and PKNN is to select the optimal number of neighbours, K. The contribution of this paper is to incorporate t...
متن کاملExtensions of the k Nearest Neighbour Methods for Classification Problems
The k Nearest Neighbour (kNN) method is a widely used technique which has found several applications in clustering and classification. In this paper, we focus on classification problems and we propose modifications of the nearest neighbour method that exploit information from the structure of a dataset. The results of our experiments using datasets from the UCI repository demonstrate that the c...
متن کاملk-Nearest Neighbour Classifiers
Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier – classification is achieved by identifying the nearest neighbours to a query example and using those neighbours to determine the class of the query. This approach to classification is of particular importance today because issues of poor run-time performance is not such...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2023
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2022.109190